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GAUSS-MARKOV RANDOM FIELDS USING SENSOR NETWORKS 
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ABSTRACT 

The detection of hidden two-dimensional Gauss-Markov random fields us- 
ing sensor networks is considered. Under a conditional autoregressive 
model, the error exponent for the Neyman-Pearson detector satisfying a 
fixed level constraint is obtained using the large deviations principle. For 
a symmetric first order autoregressive model, the error exponent is given 
explicitly in terms of the SNR and an edge dependence factor (field cor- 
relation). The behavior of the error exponent as a function of correlation 
strength is seen to divide into two regions depending on the value of the 
SNR. At high SNR, uncon'elated observations maximize the eiTor exponent 
for a given SNR, whereas there is non-zero optimal correlation at low SNR. 
Based on the error exponent, the energy efficiency (defined as the ratio of 
the total information gathered to the total energy required) of ad hoc sen- 
sor network for detection is examined for two sensor deployment models; 
an infinite area model and and infinite density model. For a fixed sensor 
density, the energy efficiency diminishes to zero at rate ©(area"^/^) as 
the area is increased. On the other hand, non-zero efficiency is possible for 
increasing density depending on the behavior of the physical correlation as 
a function of the link length. 

Index Terms- Neyman-Pearson detection, error exponent, GMRF 

1. INTRODUCTION 

Consider the design of a sensor network for the detection of a 
coffelated stochastic signal in a fixed area. Many questions arise 
in such a design: How do the field correlation and measurement 
signal-to-noise (SNR) affect the detection performance? What is 
the optimal sensor density, i.e., the number of nodes per unit area? 
What is the information and energy trade-off in such a sensor net- 
work with ad hoc routing? To address these issues, several studies 
based on one-dimensional (ID) spatial signal models have been 
conducted (see, e.g., Q] and (2)- However, there is an important 
difference between ID signal models and actual spatial signals. 
Suppose that we take observations from sensors placed equidis- 
tantly along a line transect laid over a given area. The observations 
may then be viewed as samples generated by a one-dimensional 
process and the results from time series analysis could be applied 
to investigate their statistical properties. However, there is no real 
notion of 'signal flow' or dependence direction along the transect 
as there is in a more traditionally obtained time series. For sam- 
ples from sensors deployed over a two-dimensional (2D) area, it 
is necessary to consider the signal dependence in all direction in 
the plane, and as a consequence, answering the above questions 
becomes more difficult. 

To address the above questions in a 2D setting, in this pa- 
per, we consider the detection of 2D Gauss-Markov random fields 
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(GMRFs) using noisy observations. In particular we consider Sen- 
sors ij located on a 2D lattice X. On denoting the (noisy) measure- 
ments of Sensor ij as Yij and adopting a Neyman-Pearson formu- 
lation, we can model the detection problem via null and alternative 
hypotheses given by 

Ho : Yij = Wij ,ij vs. Hi : Yij = Xij + Wij ,ij (1) 

where {Wij} represents independent and identically distributed 
(i.i.d.) A/'(0,(7'^) noise with a known variance a^, and {Xij} is 
a stationary GMRF on the 2D lattice I independent of the mea- 
surement noise {W^ij}. Thus, the observation samples form a 2D 
hidden GMRF under Hi. 




Fig. I. Sensors on a 2D Lattice I: Hidden Markov Structure 
1.1. Summary of Results 

The exact error probability of the detection of the Neyman-Pearson 
test is not available in closed-form in the general correlated case, 
including the hypotheses l[T}. Hence, we invoke the the large devi- 
ations principle and use the error exponent of the detection prob- 
ability (or, more conveniently, its complement, the miss probabil- 
ity) as an alternative performance measure. For a fixed false-alarm 
level, the miss probability Pm decays exponentially as the sample 
size n increases, and the error exponent is defined as the rate of 
exponential decay, i.e.. 



K. 



lim - 

n — *oo 



(2) 



under the given constraint (i.e., the false alarm probability Pf < 
a). The error exponent is a good performance criterion in the large 
sample regime since it allows the designer to estimate the number 
of samples required for a given detection performance. Hence, 
efficient design can be examined through the error exponent for 
large scale sensor networks. 

Here, we adopt the conditional autoregression (CAR) model 
for the signal, and derive a closed-form expression for the error 
exponent K. of the miss probability (which is independent of q) in 
the spectral domain. We do so by exploiting the spectral structure 
of the CAR signal and the relationship between the eigenvalues of 
the block circulant approximation to a block Toeplitz matrix de- 
scribing the 2D correlation structure. In particular, it is shown that 
the error exponent for the detection of 2D hidden GMRF is an ex- 
tension of that in the ID case obtained by Sung et al.(3j- As in the 
ID case, it is shown that i.i.d. (and, thus, uncorrelated) observa- 
tions maximize the error exponent for a given SNR when the SNR 
is high. On the other hand, there is an optimal non-zero degree of 



correlation at low SNR. Interestingly, it is seen that there is a dis- 
continuity in the optimal correlation strength as a function of SNR. 
In the perfectly correlated case, the error exponent is zero as ex- 
pected. For the error exponent as a function of SNR, we will show 
that the error exponent increases as log SNR for a given correlation 
strength at high SNR. 

We consider two asymptotic regimes modelling the sensor de- 
ployment in 2D: an infinite area model with a fixed density and an 
infinite density model with a fixed area. Applying the results, we 
obtain the asymptotic behavior of the energy efficiency, defined as 
the ratio of the total information gathered to the required energy to 
obtain information from the area for an ad hoc network with mini- 
mum hop routing to the fusion center. For the infinite area model, 
the energy efficiency decays to zero with rate 0(area~^''^) as we 
increase the coverage area. For the infinite density model, on the 
other hand, a non-zero efficiency is possible if the decay rate of the 
error exponent (density) as a function of density is slower than 
0(density'^~*'''^^), where S is the propagation constant 5 > 2. 

1.2. Related Work 

The detection of Gauss-Markov processes in Gaussian noise is a 
classical problem. See (4] and references therein. However, most 
work in this area considers only ID signals or time series. A 
closed-form error exponent was obtained and its properties were 
investigated for ID hidden Gauss-Markov random processes ||3]. 
Large deviations analyses were used to examine the issues of op- 
timal sensor density and optimal sampling were examined with a 
ID signal model in IT] and (Z). 

An error exponent was obtained for the detection of 2D GM- 
RFs in ||5), where the sensors are located randomly and the Markov 
graph is based on the nearest neighbor dependency enabling a 
loop-free graph and further analysis. In this work, however, the 
measurement noise was not captured. Our work here focuses on 
the error exponent for the detection of 2D hidden GMRF on a 2D 
infinite lattice, which allows for the consideration of measurement 
noise. In particular we examine the above CAR model and inves- 
tigate of the detection performance with respect (w.r.t.) to various 
design parameters such as correlation strength, measurement SNR, 
sensor density and area. 

2. DATA MODEL 

Definition 1 (GMRF (6)) A random vector X = {Xi , , • • • ,X„) 

£ R" is a Gauss-Markov random field w.r.t. a labelled graph 
Q — {v,£) with mean fi and precision matrix Q > 0, if its prob- 
ability density function is given by 



p{X) = (277) 



-n/2|Q|l/2 



cxp 



-(X-m)^Q(X-m) 



(3) 



and Qim 7^ {I, m} G £ for all I 7^ m. Here, u is the set of 
all nodes {1, 2, • • • ,n} and £ is the set of edges connecting pairs 
of nodes, which represent the conditional dependence structure. 

Note that the mean and the precision matrix fully characterize 
a GMRF. Note also that the covariance matrix Q^^ is completely 
dense in general while the precision matrix Q has nonzero ele- 
ments Qim only when there is an edge between nodes I and m in 
the Markov random field. Hence, when the graph is not fully con- 
nected, the precision matrix is sparse. The 2D indexing scheme 
(i, j) can be properly converted to an ID scheme to apply Defini- 
tion [T] From here on, we use the 2D indexing scheme for conve- 
nience. 

Definition 2 (Stationarity) A 2D GMRF on 2D doubly infinite 
lattice Too is said to be stationary if the mean vector is constant 



and Cov{Xij,Xiiji) = W,{XijXiiji} = c{i — i' ,j — j') for 
some function c(-, •). 

For a 2D stationary GMRF {Xij}, the covariance {-jij } is defined 
as 

7,, = E{X,,yXi,+i^,,+,} = E{XooX,j}, (4) 

which does not depend on i' or j' due to the stationarity. Further, 
the spectral density function of a zero-mean and stationary Gaus- 
sian process {Xij} on Too with covariance 7;^ is defined as 

^ 7ij exp(-t(ia;i + ja;2)), (5) 



where l = and {loi,lu2) G (^""j i"]^. Note that this is a 2D 
extension of the conventional ID discrete-time Fourier transform 
(DTFT). 

Definition 3 (Tiie conditional autoregression (CAR)) A GMRF 
can be specified using a set of full conditional normal distributions 
with mean and precision: 

1 



E{Xij|X_ij} — -— ^ 0i'jiXi^i,j^j,, (6) 



Prec{Xij 



,} 



i'j'eiooT^oo 
000 > 0, 



(7) 



where X_ij denotes the set of all variables except Xij. 
It is shown that the GMRF defined by the CAR model l|6j - (|7]l is 
a zero-mean stationary Gaussian process on Too with the spectral 
density function (6) 

f{oj^, UJ2) = — ^ (8) 



47r2 



if 



^o}| <oo, e,. 



000 > 0, 



(9) 



{ey}issothat/(a;i,a;2) > 0, y{uji,uJ2) G (-tt, tt]^. (10) 

We assume that the 2D stochastic signal in l|T) is given by a sta- 
tionary GMRF defined by the CAR model ^ - ^ and ^ - i fTot . 
Then, the observation spectrum under the two hypotheses l[T) are 
given, respectively, by 



S^{u;i,uj2) 



47r2 



and Sf{uJi,uJ2) 



a 

4^ 



+ f{uJl,U)2). 



3. PERFORMANCE MEASURE: ERROR EXPONENT 

In this section, we investigate the performance of the Neyman- 
Pearson detector with level a G (0, 1) for a 2D CAR signal in 
noisy observations. We obtain the error exponent in the spectral 
domain for this problem by exploiting the spectral structure of the 
CAR signal and the relationship between the eigenvalues of block 
circulant and block Toeplitz matrices representing 2D correlation 
structure. 

Tlieorem 1 (Error Exponent) Consider Ney man- Pearson detec- 
tion between the hypotheses ([TJ with the model ^ - (O and with 
level a G (0, 1). Assuming that conditions ||9| and [70t hold, the 
error exponent of the miss probability is independent of a and is 
given by 



K = 



duJiduJ2, (11) 



2a'^ +AiT'^f{uJi,ui2) 2 J 
where D[-\\-) denotes the Kullback-Leibler divergence. 



Proof: K, is given by the almost-sure limit of the asymptotic 
Kullback-Leibler rate K = lim„_oo ^ log (y„) evaluated under 

po,n(Zl- Using the fact that we have Gaussian distributions under 
both hypotheses, we have 



K = 



lim 



1 dot(i:i , 

— loff ■ 

2 dot(So,, 



Then approximating the block Toeplitz correlation matrix with a 
block circulant matrix and applying the the 2D Grenander-Szego 
theorem, we obtain the limit of each term as follows. 



■ log dct(Si ,„) 



-log dot(So,„) 



1 

47T 



I j log(o-^ + 47r^/(wi, W2))rfi^l<i^2, 

2 



log IT 



n 

almost surely. ■ 

This theorem is a 2D extension of the error exponent of ID 
hidden Gauss-Markov model based on state-space structure ob- 
tained in [3]. Intuitively, the error exponent Jllb can be explained 
using the frequency binning argument. For each 2D frequency seg- 
ment dujiduj2, the spectra are flat, i.e., the signals are independent 
and Stein's lemma can be applied for the segment. The overall 
Kullback-Leibler divergence is the sum of contributions from each 
bin. 

3.1. Symmetric First Order Autoregression 

To investigate the behavior of the error exponent as a function of 
correlation and SNR, we further consider the symmetric first order 
autoregression (SFAR), described by the conditions 



E{Xi,|X_,j} 
Prec{Xij|X_ij} 



A , 



-1,3 



-(Xi+ij 

K 

ft > 0, 

where < A < ^. (This is a sufficient condition to satisfy (|9} - 
UOIl.) Note here that 6*00 = Kand^i.o = S-i.o = do.i ~ ^'o,-i = 
— A. In this model, the correlation is symmetric for each set of four 
neighboring nodes. The SFAR model is a simple yet meaningful 
extension of the ID Gauss-Markov random process, which has the 
conditional causal dependency only on the previous sample. Here 
in the 2D case we have four neighboring nodes in the four (planar) 
directions. The spectrum of the SFAR is given by 



47r^ft;(l — 2("cos(i;i — 2i^costi^2) 



We define the edge dependence factor C, by 



C^-, 0<C<l/4. 

K 



(12) 



(13) 



Note that C = corresponds to the i.i.d. case whereas = 1/4 
corresponds to the perfectly correlated case. Hence, the correlation 
strength can be captured in this single quantity for SFAR signals. 
The power of the SFAR is obtained using the inverse Fourier trans- 
form via the relation (|5}, and is given by 

2X(4C) 



■ 700 



0<C<4 



(14) 



where K[-) is the complete elliptic integral of the first kind (8|. 
The SNR is given by SNR = ^ = 2^^- Using eq. (IB and 
the SNR, we obtain the error exponent in the SFAR signal case, 
denoted by K,s and given in the following corollary. 



Corollary 1 The error exponent for the Neyman-Pearson detector 
for the hypotheses ([7]) with the SFAR 2D signal model is given by 



(2/,r)R'(4C)(l - 2Cc 



SNR 

(2/,)K(4C)(l-2<;co= 



^1— 2C cos (^2) 



Note that the SNR and correlation are separated in l |15K which 
enables us to investigate the effects of each term separately. 

3.2. Properties of the Error Exponent KLa 

First, it is readily seen from Corollary [T] that tCs is a continuous 
function of the edge dependence factor C, (Q < C, < 1/4) for a 
given SNR. The values of K,a at the extreme correlations are given 
by noting that K{Q) — ^ and A'(l) = oo. Therefore, in the 
i.i.d. case (i.e., — 0), the corollary reduces to Stein's lemma as 
expected, and ICs is given by 



Ks = ilog(l + SNR) + — 

2 2(1 + SNR) 2 



D(Ar(0,a^)\\J^(n,<j^+P,)). 



For the perfectly correlated case (^ — 1/4), on the other hand, 
ICs = 0. In fact, in this case as well as in the i.i.d. case, the 
two-dimensionality is irrelevant. The known result that Pm ~ 



e(r 



-1/2n 



for the perfectly correlated case is applicable. 



For intermediate values of correlation, we evaluate dlSt for 
several different SNR values, as shown in Fig. (2] It is seen that at 




5 0.2 0.35 



(c) 



(d) 



Fig. 2. ICs as a function of (: (a) SNR = 10 dB, (b) SNR ■■ 
(c) SNR = -3 dB, (d) SNR = -5 dB 



OdB. 



high SNR ICs is monotonically decreasing as increases. Hence, 
i.i.d. observations give the best error performance for a given value 
of SNR when SNR is large, as in the ID case (3). As we decrease 
the SNR, it is observed that a second mode grows near — 1/4. 
As we further decrease the SNR, the value of of the second mode 
shifts toward 1 /4, and the value of the second mode exceeds that 
of the i.i.d. case. Hence, there is a discontinuity in the optimal 
correlation as a function of SNR in the 2D case even if the maximal 
ICs itself is continuous. This is not the case in ID. 

With regard to ICs as a function of SNR, it is straightforward 
to see that it is continuous and increases at the rate log SNR at high 
SNR for a given value of 



4. AD HOC NETWORKING: INFORMATION-ENERGY 
TRADE-OFF 

The analytical results in the previous section can be applied to an- 
swer some fundamental questions in the design of sensor networks 
for detection applications. We consider a planar ad hoc sensor net- 
work with minimum hop routing. To simplify the analysis, we as- 
sume that (2n + 1)'^ sensors are located on the grid [—n : 1 : 
with spacing r„, as shown in Fig. [T] and a fusion center is located 
at the center (0, 0). We assume that the measurement Yij is de- 
livered to the fusion center using the minimum hop routing, which 
requires a hop count of |i| + | j|. 
4.1. Physical correlation model 

The actual physical correlation in this model can be obtained by 
solving a proper continuous index 2D stochastic differential equa- 
tion (SDE), e.g., 



X{x,y) = u{x,y), 



where u{x, y) is the process noise and ^ is a parameter determin- 
ing the correlation strength of the field. By solving a proper SDE, 
the edge correlation factor p is given, as a function of the edge 
length r„ , by 

P = fi-Tn). 

Typically, /(•) is a positive and monotonically decreasing function 
of Tn- Further, we have a monotone mapping g : p —> from the 
edge correlation factor p to the edge dependence factor which 
maps zero and one to zero and 1/4, respectively. Thus, we have 
= g{f{rn)), and for given physical parameters (with a slight 
abuse of notation), 

/C.(SNR,C) = <^s(SNR,g(/(r„))) =/Cs(SNR,r-„). 

We will use the arguments SNR and ( for ICs properly if necessary. 

4.2. Energy efficiency 

We now consider the energy efficiency of the ad hoc sensor net- 
work as the network size grows. The energy efficiency 77 can be 
defined as 

total gathered information It 
total required energy ht 
where It is given by the product of the number of sensors and 
the information ICs per each sensor. We consider two asymptotic 
regimes for the increase in network size: an infinite area model 
with fixed density and an infinite density model with fixed area. 
The behavior of the energy efficiency as we increase the network 
size is summarized in the following theorems. 

Theorem 2 (Infinite area model) For an ad hoc sensor network 
with increasing area and a fixed node density, the energy efficiency 
decays to zero as we increase the area with rate 



-1/2 



(17) 



Proof: The total energy required for data gathering is given by 

Ti n 

Et = Ei 

ink ij'n ink (^n), 

i——n j=—n 

where the transmission energy per link Eunkir-n) ~ and S is 
the propagation loss factor. We have It — {2n + l)^ICs{rn), and 
area = e(n^). The energy efficiency is given by 

(2n-Hl)2/C^(r„) 



Since r„ is fixed, ICs and Eunk do not change with n, and ( I17t 
follows. ■ 

Theorem 3 (Infinite density model) For the infinite density model, 
a non-zero efficiency is possible if the decay rate of the error expo- 
nent ICs as a function of density is slower than 



O {density'-^-^'"^'^ 



(19) 



2n(n-hl)(2n-hl)Eii„fe(r„)' 



(18) 



Proof: For the infinite density model, we have 

rn = Q{n~^), = 0(ri^*), density = 0(71^). 

From l[T8), we have t] — ICa{r„)/n^~^ . If ICs as a function of r„ 
decays slower than n^^^ , rj does not diminish to zero. ■ 

The non-zero efficiency in the asymptotic regime depends on 
the decay rate of ICs as a function of r „. Since ICs (C) is given, this 
depends on the functions / and g in Section l^Tl and the propaga- 
tion loss factor 5. 

5. CONCLUSIONS 

We have considered the detection of 2D GMRFs from noisy ob- 
servations. We have adopted the CAR model for the signal, and 
have used a spectral domain approach to derive the error expo- 
nent for the Neyman-Pearson detector satisfying a fixed level con- 
straint. Under the symmetric first order autoregressive model, we 
have obtained the error exponent explicitly in terms of the SNR 
and the edge dependence factor. We have investigated the proper- 
ties of the error exponent as a function of SNR and correlation. We 
have seen that the behavior of the error exponent w.r.t. correlation 
strength is divided into two regions depending on SNR. At high 
SNR, i.i.d. (and, thus, uncorrelated) observations maximize the 
error exponent for a given SNR, whereas there is non-zero opti- 
mal value of correlation at low SNR. Further, it has been seen that 
there is a discontinuity for the optimal correlation as a function of 
SNR. Based on the error exponent, we have also investigated the 
energy efficiency of ad hoc sensor network for detection applica- 
tions. For a fixed node density, the energy efficiency decays to zero 
with rate 0(area~^''^) as we increase the area. On the other hand, 
non-zero efficiency is possible with increasing density depending 
on physical correlation strength as a function of the link length. 
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